2 |
On the Evolution of Syntactic Information Encoded by BERT's Contextualized Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
How much pretraining data do language models need to learn syntax? ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Semantically-oriented text planning for automatic summarization
|
|
|
|
In: TDX (Tesis Doctorals en Xarxa) (2021)
|
|
BASE
|
|
Show details
|
|
7 |
The Third Multilingual Surface Realisation Shared Task (SR'20): Overview and Evaluation Results ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
The Third Multilingual Surface Realisation Shared Task (SR'20): Overview and Evaluation Results ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
CollFrEn: Rich Bilingual English--French Collocation Resource ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
The second multilingual surface realisation shared task (SR'19): Overview and evaluation results
|
|
|
|
In: Mille, Simon orcid:0000-0002-8852-2764 , Anja, Belz, Bohnet, Bernd, Graham, Yvette and Wanner, Leo orcid:0000-0002-9446-3748 (2019) The second multilingual surface realisation shared task (SR'19): Overview and evaluation results. In: 2nd Workshop on Multilingual Surface Realisation (MSR 2019), 3 Nov 2019, Hong Kong, China. (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Collocation classification with unsupervised relation vectors
|
|
|
|
BASE
|
|
Show details
|
|
13 |
A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content
|
|
|
|
In: Front Robot AI (2018)
|
|
BASE
|
|
Show details
|
|
14 |
Towards Distributional Semantics-Based Classification of Collocations for Collocation Dictionaries
|
|
|
|
In: International Journal of Lexicography 30 (2017) 2, 167-186
|
|
IDS OBELEX meta
|
|
Show details
|
|
15 |
Multilingual Surface Realization Using Universal Dependency Trees
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Feature engineering for author profiling and identification: on the relevance of syntax and discourse
|
|
|
|
In: TDX (Tesis Doctorals en Xarxa) (2017)
|
|
Abstract:
Author profiling and identification are two areas of data-driven computational linguistics that have gained a lot of relevance due to their potential applications in, e.g., forensic linguistic studies, marketing analysis, and historic/literary authorship verification. Author profiling aims to identify demographic traits of the authors, while author identification aims to identify the authors themselves by searching for distinctive linguistic patterns that distinguish them. The majority of approaches in the related work tends to focus on the content of the texts. We argue that focusing on structure rather than content can be more effective. The main focus of the thesis is thus on feature engineering, the development, evaluation and application of the feature set in the context of machine learning techniques to author profiling and identification. We prove the profiling potential of syntactic and iscourse features, which achieve state-of-the-art performance in many different scenarios, especially when combined with other features. ; El perfilament i la identificació d’autors són camps de la lingüística computacional que han guanyat rellevància als últims anys gràcies a les seves potencials aplicacions al camp de la lingüística forense o a la verificació d’autoria de textos històrics. El perfilament d’autors té com a objectiu identificar trets demogràfics dels autors; la identificació d’autors tracta d’identificar l’autor del text. Per fer-ho, es busquen automàticament patrons lingüístics per diferenciar entre autors/trets demogràfics. La majoria de treballs anteriors, es centren en el contingut dels texts. Nosaltres argumentem que analitzar l’estructura del text pot ser una alternativa més efectiva. El focus d’aquesta tesi està per tant, al feature engineering: la extracció avaluació i utilització d’un conjunt de característiques lingüístiques amb algoritmes d’aprenentatge automàtic per a perfilar/identificar autors. Demostrem que les característiques sintàctiques i discursives són rellevants i que combinades amb altres, obtenen resultats a l’altura de l’estat de l’art.
|
|
Keyword:
62; Aprenentatge automàtic; Author identification; Author profiling; Classificació de text; Discourse; Discurs; Estilometría; Feature engineering; Gender identification; Identificació d'autors; Identificació de gènere; Machine learning; Natural language processing; Perfilament d'autors; Processat del llenguatge; Sintaxis; Stylometry; Syntax; Text classification
|
|
URL: http://hdl.handle.net/10803/404984
|
|
BASE
|
|
Hide details
|
|
17 |
Processament automàtic de patents: un exercici de terminologia computacional
|
|
|
|
In: Terminàlia; Núm. 16 : desembre 2017; p. 54-56 ; 2013-6692 (2017)
|
|
BASE
|
|
Show details
|
|
18 |
Combining Acoustic and Linguistic Features in Phrase-Oriented Prosody Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Classification of Grammatical Collocation Errors in the Writings of Learners of Spanish ; Clasificación de errores gramaticales colocacionales en textos de estudiantes de español
|
|
|
|
BASE
|
|
Show details
|
|
|
|